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LARGE DEVIATIONS FOR RANDOM POWER MOMENT 

PROBLEM 



By Fabrice Gamboa and Li- Vang Lozada-Chang 

Universite Paul Sabatier and Universidad de la Habana 

We consider the set M n of all n-truncated power moment se- 
quences of probability measures on [0, 1]. We endow this set with the 
uniform probability. Picking randomly a point in M n , we show that 
the upper canonical measure associated with this point satisfies a 
large deviation principle. Moderate deviation are also studied com- 
pleting earlier results on asymptotic normality given by Chang, Kemperman and Studden 
[^4nn. Probab. 21 (1993) 1295-1309]. Surprisingly, our large deviations 
results allow us to compute explicitly the (n + l)th moment range size 
of the set of all probability measures having the same n first moments. 
The main tool to obtain these results is the representation of M n on 
canonical moments [see the book of Dette and Studden]. 



1. Introduction. In this work we will study the asymptotic behavior in 
large deviations of random power moment problem. Let P([0, 1]) denote the 
set of all probability measures (p.m.s) on the interval [0, 1]. In the whole pa- 
per this set will be endowed with the weak topology [see Billingsley (1999)]. 
For any \x in P([0, 1]) the Arth (power) moment of (i will be denoted by Cfc(/u): 

c fe (/x) = / x k d/j,(x). 

J[0,1] 

In this paper we focus on some asymptotic properties of the finite moment 
space M n : 

M n = (c^V) = (ci(/x), . . • , c n ([i)) T : fi G P([0, 1])}, n G N*. 

M n is the closed convex hull of the curve 

{(x,x 2 ,...,x n ) T -xG[0,l]} 

[see Karlin and Studden (1966) and Krein and Nudelman (1977)]. As M n is 
a compact subset of M n having nonvoid interior, we may define the uniform 
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probability ¥ n on M n . M n is a very "small" set. Indeed, its volume has 
order 2~ n for large n. Moreover, this set "concentrates" (in some sense) 
on a single point. More precisely, Chang, Kemperman and Studden (1993) 
have shown that for any fixed k £ N* and under the probability P n , the k 
first components of £ M n converge in probability to the k first moments 
of the arcsine law. That is, 



(1) 



lim F n 

n— >oo 



/ 
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0. 



£>o, 



where Cj = Cj(u), j £ N*, and 



(2) 



v{dx) 



dx 



Hence, M n is concentrated around the rath first moments of the arcsine 
law. Moreover, Chang, Kemperman and Studden (1993) have studied the 
fluctuation limit law in (1). They have shown that the limit distribution of 
the fluctuations is Gaussian: 



(3) 



V^(4 fc) -c {fc) ) £ -^AA fc (0,S fc ) 



(k) 

where Zn denotes the random vector built with the k first coordinates 
of Z n = c*- n ) (drawn randomly with distribution ¥ n ). = (ci,C2, . . . ,Ck) T 
is the vector of the k first moment of v. The covariance matrix will 
be described in Section 2.5. The main result of this paper is a functional 
large deviations principle for the sequence {Zi k) ). This means that zi k) 
"concentrates" exponentially fast. More precisely, for any £ M n , there 
exists a unique measure a^(c^) whose n first moments are and max- 
imizing the (ra + l)th moment [see Section 2.3, Karlin and Studden (1966), 
Krein and Nudelman (1977) and Dette and Studden (1997)]. In Section 2.4 
we show that cr^(Z n ) satisfies a large deviations principle (LDP). Roughly 
speaking, this means that a+(Z n ) concentrates exponentially fast on v. In 
other words, for large n, 



(4) 



\(a+(Z n ) £ A)^exp 



-ra inf I(u) 



where A is a Borel- measurable set of P([0, 1]) and / is the rate function 
of the LDP (see Sections 2.1 and 2.4 for the more precise statement). Al- 
though we prove this main result using a limit projective approach [see 
Dembo and Zeitouni (1998)] we manage to compute precisely the rate func- 
tion I. This is performed using Szego asymptotic theory on orthogonal 
polynomials [Grenander and Szego (1958)]. Surprisingly, / is the reversed 
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Kullback information (or cross entropy) with respect to v. Moreover, this 
computation gives quantitative evaluations of the (n + l)th moment range 
size of the set of all probability measures having the same prescribed n first 
moments in term of the reversed Kullback information. To our knowledge 
these results on power moment problem are new, they are developed in 
Section 3. 

In Section 2.5, we give large and moderate deviation principles for the ran- 

(k) 

dom vector Z\ . Hence, we study exponential rates of convergence, between 
(3) and (4), for this vector. All the proofs are postponed to Section 4. This 
last section begin with a section on canonical moments [Dette and Studden 

(1997) ] which are the main tool to prove our results. 

2. Main results. 

2.1. Large deviations. Let us first recall what is a LDP [see, e.g., Dembo and Zeitouni 

(1998) ]. Let (u n ) be a decreasing positive sequence of real numbers. 

Definition 2.1. We say that a sequence (R n ) of probability measures 
on a measurable Hausdorff space (U,B(U)) satisfies a LDP with rate function 
/ and speed (u n ) if: 

(i) / is lower semicontinuous (l.s.c), with values in K + U {+oo}. 

(ii) For any measurable set A of U, 

— I(intA) < lim inf u n log R n ( A) < limsupti n logi2 n (^4) < — I(cloA), 

where 1(A) = inf£ gj 4 /(£) and hit A (resp. cloA) is the interior (resp. the 
closure) of A. 

We say that the rate function / is good if its level set {x £ U : I(x) < a} 
is compact for any a > 0. More generally, a sequence of U- valued random 
variables is said to satisfy a LDP if their distributions satisfy a LDP. 

To be self-contained let us recall some facts and tools on large deviations 
which will be useful in the paper [we refer to Dembo and Zeitouni (1998) 
for more on large deviations]: 

• Contraction principle. Assume that (R n ) satisfies a LDP on (U, B(U)) with 
good rate function / and speed (u n ). Let T be a continuous mapping from 
U to another space V. Then, (fl^r 1 ) satisfies a LDP on (V,B(V)) with 
good rate function 

I'(y)= inf I(x), yeV, 

x:T(x)=y 

and speed (u n ). 
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• Exponential approximation. Assume that U is a metric space and let d 
denote the distance on U. Let (X n ) be a [7-valued random sequence satis- 
fying a LDP with good rate function / and speed u n . Let (Y n ) be another 
[/-valued random sequence. If for any £ > 0, 

limsupit n logP(<i(X n , Y n ) > £) = -oo, 

n— >oo 

then (l^j) shares the same LDP as (X n ). 

In the sequel, talking about large deviations, if we omit the sequence (u n ) 
it means that u n = ^ (n£ N*). Following Section 3.7 of Dembo and Zeitouni 
(1998), when the sequence (u n ) satisfies n _1 = o(u n ) we will say moderate 
deviations rather than large deviations with speed (u n ). 

2.2. Kullback and reversed Kullback information. Let (U,B(U)) be a 
measurable space and P and Q be p.m.s. on (U,B(U)). Recall that the 
Kullback information or cross entropy of P with respect to Q is defined by 

!r dP dP 

V 0g dQ iP - ■fP<«3-dlog- ei '(P), 
+oo, otherwise. 

Properties of K as a function of P may be found in Bretagnolle (1979). K is 
the rate function for Sanov large deviations theorem [see Dembo and Zeitouni 
(1998)]. The rate function involved in this paper is the reversed Kullback 
information with respect to v, that is, 

I(li) = K(v,n), AteP([0,l]), 

where v is the arcsine law (2). Observe that / is l.s.c. [Theorem 2.7 of 
Borwein and Lewis (1993)]. Moreover, it is obviously a good rate function 
[P([0, 1]) is a compact set]. 

The following property may be found in Theorem 2.1 of Gamboa and Gassiat 
(1997) and in Section 3 of Borwein and Lewis (1993): 

Proposition 2.2. (i) For fi G P([0, 1]), 

I(p)= sup ( [ f(x)dn(x)+ [ ln(l -f(x))du(x) 
/ec[0,i] V^o Jo 

where C[0, 1] is the set of all continuous functions on [0, 1]. (In the whole of 
the paper we take the convention lnr = — oo whenever r < 0.) 

(ii) Let $ = ($i, ... , $ fc ) T G (C[0, l}) k and for c G R k , 

S*(c) = L G P([0, 1]) : jf 1 ^-(x) d/i(x) = Cj,j = l,...,k 
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Then, 

inf J(/i)= sup (a + (A,c)+ [ ln(l-\ -(\,®(x)))du(x)X, 

Me§*(c) (A ,A)gR fe + 1 I JO J 

where (•,•) denotes the usual scalar product on 

We will come back on the second point in Section 2.5. This property will 
be helpful to show the results of Section 3. 

For x G [0, 1] and k G N*, let 4>k(x) = (x,x 2 , . . .,x k ) T . For c G we set 
S/c (c) = §0 fe (c) . In the next section we briefly recall some known facts on 
Sfc(c). In Section 3 we give new results on §fc(c). 

2.3. Power moment problem. Let M denote the set of all infinite moment 
sequences, that is, 

M = { C ( oo )( / i) = (c i ( / i)) jeN :^GP([0,l])}. 

For n G N*, let IT n : M — ► M n denote the natural projection map. So, we have 
M n = U n (M). 

Let recall some useful facts on the power moments problems [see Karlin and Studden 
(1966) and Krein and Nudelman (1977) for an exhaustive overview on this 
problem]. Because [0, 1] is a compact interval, the relation between the ele- 
ments of M and P([0, 1]) is bijective. In general, for c^ G M n , there exists 
an infinite number of probability measures such that c^ = c/ n )(/i). More 
precisely, for any n G N* and c^ G MJ 1 : 

(i) #S n ( c W) = +oo^c( n ) GintM n , 

(ii) #S„(c( n )) = 1 O cW G dM n (the boundary of M n ), 

(iii) #S n ( c W) = 0^c(")^M n , 

where j^A is the number of elements lying in A. These results come from 
the fact that (<f> n ) is a Tchebycheff system [see Karlin and Studden (1966) 
and Krein and Nudelman (1977)]. The elements of dM n satisfy an extremal 
property. For c^ k ' G let c + , c~ : — > R be defined by 

c +( c (*)) = max { c G R : (ci, c 2 , . . . , c fc , c) T G M fc+1 }, 

c -( c ( fe )) = min{cG R: (ci,c 2 , . . . ,c fc ,c) T G M fe+1 }. 

Then, C ( fc+1 ) = ((c( fc )) T ,c fc+1 ) T G 9M H1 if, and only if, c k+1 = c + {c^) [or 
c-(c< fc ))]. 

Let d< n ) G M n . We will denote by o-+(d (n) ) the measure /i such that 
c( n )(/i) = d^ n ^ and c n+ i(/u) = c + ((i^ n ^). This measure is the so-called upper 
canonical representation of the finite moments sequence d^> [see Karlin and Studden 
(1966) and Krein and Nudelman (1977)]. In the next section we will study 
large deviations properties of o~£(Z n ). 
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2.4. Large deviations for (Z n ) . Recall that P n denotes the normalized 
Lebesgue probability measure on M n and Z n denotes a random vector having 
distribution P n . For k <n, let 11^ : M n — > be the projection map taking 

(k) 

the k first coordinates of an element of M n . Let Z n denote the random 
vector n^(Z n ) and /i n be the random measure a^(Z n ). Let Q n be the law 
of \i n . Our main result is the large deviations principle for (^£ n ). 

Theorem 2.3. (fi n ) satisfies a LDP with convex good rate function I. 

Corollary 2.4. (/i n ) converges in probability to v. 

We can also study the LDP associated to the so called tilted measures. 
Changing a little the measures Q n on P([0, 1]), we obtain the new limit law 
for the sequence of random probability measures fi n . Namely, consider on 
P([0, 1]) the new probability measure Q n defined by 

/ r\ , m _ EQ„(exp(n-F(/Zn))lLB(/Jn)) 

E Qnexp(nF(/i„)) 

where B runs over the Borelian sets of P([0, 1]) and F:P([0, 1]) — > R is the 
continuous functional defined by 



F(jj)= / (x)d/i(x), 
J[0,1] 

where / GC([0,1]). 

We have the following results. 

Theorem 2.5. (Qn.)neN satisfies a LDP on P([0, 1]) with good rate func- 
tion 

I F (n)=I(ji)-F(n)+K F , 
where Kp := sup /i / 6P([0)1 ]){-F(/x / ) - I(n')}. 

Remark 2.1. Let jl n denote a random measure of P([0, 1]) having dis- 
tribution Q n . The existence of an unique minimum point for Ip implies the 
convergence (in probability) of jl n toward this minimum point. Under cer- 
tain conditions over /o we can characterize the minimum points of Lp. Let 
fji G P([0, 1]) be a minimum point of Ip. Let (j, = fj, + fx be the Lebesgue 
decomposition of \i with respect to v (jii <f). Then, from Theorem 3.5 of 
Borwein and Lewis (1993) there exists A* G R with: 

M 9X* = ^f= a.s., 
(ii) sup P/ u c{xG [0,l]:/ (x)=A*}. 
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We now give two particular cases where the sequence (/2 n ) has a limit. 
Let A := max^p,!] f (x) and xo := J[ ,i](^ ~ /o) _1 dv. 

(i) Assume xo > L then there exist A* > A such that 
g x *{x)dv{x)= ( J V{ *\ =1 



[0,1] J [0,1] A* - f (x 

and (/2 n ) converges in probability to g\* v. 

(ii) If xo < 1 and {% £ [0, 1] :/o = A} reduces to the singleton {xq}, then 
(Jin) converges in probability to 

9\V + 0--Xo)&x - 

2.5. Large and moderate deviations for finite moments sequence. Al- 
though we may obtain a LDP for (Zn) using Theorem 2.3 and the contrac- 
tion principle, we will show the following theorem. 



Theorem 2.6. (Z n ) satisfies a LDP with convex good rate function 

(6) I k ( c « ) = ( " ln ( c+ ( c(fc) ) " c " ( c(fc) ) ) - * h 4, tf c^emtM k , 

I +oo, otherwise. 

Theorem 2.3 will be shown by first proving Theorem 2.6 and a projective 
limit argument [see Section 4.6 in Dembo and Zeitouni (1998)]. 

Remark 2.2. I k achieves the value only at c^'. Hence, using Borel 
Cantelli lemma, that yields to the almost sure convergence of Zi k) to d k \ 

We now turn on moderate deviation properties of (Z^). For i,j G N*, 
define 

(7) aij 



2- 2i + 1 (, 2 *, ], if l<i<i, 



i-3. 

0, if j>i. 

For keW, let A k = (ay)^ - =1 and set 

^k — 2 k k 

and 

J k {x) = \x T Y l - k 1 x, xeR k . 

Theorem 2.7. Lei (u n ) be a sequence decreasing to such that n — 

o(u n ) and let Z^ '■= ^nu n (Zn^ — c^). Then (Z n ) satisfies a moderate 
deviations principle with good rate function J k . 
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3. On the (n + l)th moment range size of the probability measures hav- 
ing the same n first moments. In this section, for c^ k ' E M k (k E N*), we 
set 

r fc+1 ( C W)= sup ( P x k+1 dfrix) - f 1 x k+1 dfi 2 (x)\. 

w,^2e§ fc (c( fc )) Uo Jo ) 

Using the large deviations properties of the previous section and the con- 
traction principle, we obviously obtain the following: 

Theorem 3.1. Let E M k . Then: 
(i) 

(8) r fc+1 (c (fc) ) = expf- inf - khiA 



M eS fe (cW) 

(ii) Lei G P([0, 1]) and = (fi) , then 

hm iln[4 fe r fe+1 ( C W)]=-/( M ). 

fc— >oo fe 

Remark 3.1. (i) Using Proposition 2.2, (8) may be expressed as the 
supremum of a concave function. Indeed, we have 

r fc+ i(c (fc) ) = exp(-A;ln4- sup H k (X , A,c (fc) ' 

V (A ( ),A)eM n+1 



where for (Aq, A) E 



F fc (A , A, c«) = A + (A, c<*>} + / ln(l - A - (A, <p k {x))) dv(x). 



So for all (A ,A) GR W , 

r k+ i (c« ) < exp[-/cln4 - H k (X , A, C W )] . 

This last inequality is helpful to study the superresolution rate in the power 
moment problem. This will be done in a forthcoming paper of Gamboa 
and Lozada. Superresolution occurs for C ( fc ) E dM k . In this case as we saw 
in Section 2.3, E> k (c^) reduces to a single p.m. Superresolution rate is the 
concentration rate of the set S k (c^ +rj) when n E M. k is a small perturbation 
[see Gamboa and Gassiat (1996) and Doukhan and Gamboa (1996) for more 
on this problem]. 

(ii) Let P be a polynomial having degree k E N*. Assume that P is 
positive on [0, 1] and satisfies the normalizing condition 

rl dv(x) _ 
o P(x) ~ 
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Set, for j 6 N*, dj = Cj(-p) and, as usual, S-^ = (dj)j=i,...j. Then, using op- 
timization results developed in Theorem 2.1 of Gamboa and Gassiat (1997) 
or Theorem 4.1 of Borwein and Lewis (1993) we have, for j > k, 

l(-)= inf 1(a) 
\PJ pes^dU)) 

(see also Section 2.5 for related results). Therefore, in this frame Theorem 
3.1 gives 

rj + i(d^) = exp^— jln4 — J lnP(x) dv{x)\, j>k. 

(iii) Obviously, using the contraction principle and Theorem 2.6, we ob- 
tain that the sequence (rk + ±(Z^)) satisfies a LDP with good rate function, 

RM = /-M4V), if rE]0,4" fc [, 
\ +oo, otherwise. 

4. Proofs. 



4.1. Canonical moments. This section is devoted to canonical moments 
which are the basic tool for our results [as those obtained in Chang, Kemperman and Studden 
(1993)]. We will present few properties of canonical moments. An exhaus- 
tive study of canonical moments and their applications can be found in the 
very nice book of Dette and Studden (1997). Let pi denote the zth canonical 
moments defined in int , by 

p i (c^)=p l (c^) 

<rf(c(i-l))-c-( c (i-l))' 

where c + and c~ have been defined in Section 2.3. For k 6 N*, we will denote 
by Y>k the map from intMfc to ]0, l[ k , 

c (k) _> p (*)( c (*)) = ( pi ( Cl ), . . . , P2 ( C ( 2 )), . . . ,p k (c^)f. 

The map p& is a continuous one-to-one onto correspondence between the 
interior of the Arth moment space and ]0, l[ k [see, Skibinsky (1968) and 
Dette and Studden (1997)]. Obviously, the sequence (pfe) induces a one- 
to-one correspondence between intM and ]0, 1[ N . A very interesting 
property of the canonical moments is that they are invariants under linear 
transformations of the interval. Therefore, as the reader will easily check, 
all the results obtained on [0, 1] are also valid for any other bounded closed 
interval of the real line. The canonical moments of the arcsine law v are all 
equal to \ [see Skibinsky (1969)]. We will denote them by pj. 
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In Lemma 1.4 of Chang, Kemperman and Studden (1993) the first Taylor 
expansion of pk around p^ is given, 

m / m \ 

(9) c m = c m + 2j2 a mj(Pj - |) + OI ^2\pj - \\ 2 , 

j=l \j=l / 

where, for m,j £ W, = (ci, . . . , c m ) T £ wbM m , p( m \c^) = (pi,..., 
p m )e]0,l[ m and a m j has been defined in Section 2.5. 

We recall that the beta distribution (f5(a,b)) of parameters a,b > 0, has 
density with respect to the Lebesgue measure on [0, 1] , 

[B(a,b)]- 1 x a ~ 1 0--x) b - 1 , 

where B(a, b) = J [01] X s " 1 (1 - xf' 1 dx. 
We have the following lemma. 

Lemma 4.1 [Theorem 1.3 of Chang, Kemperman and Studden (1993)]. 
If M n is endowed with ¥ n , then the random vector p^ = (pi,P2, ■ ■ ■ ,Pn) T 
satisfies: 

(i) (pi)2 =1 are independent random variables, 

(ii) pi ~ (3(n — i + l,n — i + = 1,. . . ,n. 

Now, we display an equation linking canonical and ordinary power mo- 
ments given in Skibinsky (1967), for EintM n , 

n 

(10) r n+1 ( C W) = c+( c (»)) - c -( c (»>) = n^( c(n) )(! -Pi(c {n) ))- 

i=i 

In (15) we will express r n+ i(c^) as a function of Hankel determinants. 
4.2. Proof of Theorem 2.6. 

Lemma 4.2 [Exercise 4.2.7 of Dembo and Zeitouni (1998)]. Let (X n>1 ) 
[resp. (X nt 2)] be a sequence of random variables taking its values in a regular 
space X\ (resp. X2) satisfying a LDP with good rate function I\ (resp. I2). 
If X n> i and X Ht 2 are independent, then X n = (X n i,X nt 2) satisfies a LDP in 
X\ x X2 with good rate function I(x\,X2) = Ii(x\) + h(x2)- 

Lemma 4.3. The sequence of p.m. s (/3(n, n)) ne pj satisfies on [0, 1] a LDP 
with good rate function 



I(x) = — Yn(x — x 2 ) — ln4. 
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Proof. Let X n be a random variable having /3(n,n) distribution, then 
we may write 

where the random variables (Yj) are independent with standard exponential 
distribution [see, e.g., Bartoli and Del Moral (2001), page 71]. Consider the 
bidimensional random vector 

From Lemma 4.2 and Cramer's theorem [Theorem 2.2.3 in Dembo and Zeitouni 
(1998)], we obtain a LDP for (V n ) with good rate function 

iy (x, y) = x + y — 2 + log xy, x > 0, y > 0. 

Hence, the LDP for (X n ) follows from the one for (V n ) and the contraction 
principle with the continuous map (x,y) t— > x/{x + y). □ 

From Lemmas 4.3 and 4.1, we have that, for j G N*, {pj(Z n )) n ^ (j G N, 
fixed) satisfies a LDP with good rate function /. Thanks to Lemma 4.2, for 
k G N, we also have a LDP for (p^ (Z n )) ne ^ with good rate function 

k 

W k) ) = -Y, ln &-p 2 i)-kln4, 
i=i 

where pM = (pi,p 2 , ■ ■ ■ ,Pk) T G ]0, l[ fc . 

We have = p^ 1 (p^ (Z n )) . The contraction principle yields the LDP 
for (Zn^)neN with good rate function, 

4(c (fc) )=WV fc) ))- 
Using (10), we may write Ik as in (6). 

It is obvious that the function Ik achieves its minimum value at p^ 
(and only at this point). Indeed, remember that pj = ^ for 1 < j < k. Con- 
sequently, the function Ik achieves its minimum value at c^. 

4.3. Proof of Theorem 2.7. 

Lemma 4.4. Let (u n ) be a sequence decreasing to and n = o(u n ). Let 
Y n be a random variable having (5{n — l,n — I) distribution (n G N*,Z G N). 
Set 

X n := y/nu n (Y n - \), 
then (X n ) satisfies a LDP with good rate function J\(x) = 4x 2 and speed 
(«n). 
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Proof. It is well known that B n = ((n — l)!) 2 /(2n — 1)!. Using Stirling's 
formula for n\, we have 



[11) 



1/2 



(2 + &0 



with £ n — > as n — > oo. Therefore, lim n ^ lnl? n = — In 4. 

Let A := {]a, b[ : a < b}. A is a base of the usual topology on M. Let ]a, 6[G 
„4. For n large enough, 



P(X n G]a,6[)=P [Y n e 



1 a 1 6 



<£-^_ 1 m + (- + 



2 yjnu n ' 2 .y/nU 

l/2+a/y/nu-n, 
1 



„ra-«-l 



(1-X) 



n-J-1 



r/.r 



a 1 6 



2 y/nu n 2 Wnu r , 



n-l-l 



b — a 



where m + (a, b) : = sup{i -t 2 :iG]o, fe[} and m~(a, b) := inf{t — t 2 : t G ]a, &[}. 
The function i i— > t 2 — t is strictly convex in ]0, 1[ with minimum value at ^, 
so 



1 a 1 6 



2 y/nu n 2 ^/nu r 



f 1 a 2 



4 rra„ ' 
1 b 2 



4 ntt n ' 
10, 



if a > 0, 

if fr<0, 
if a < < b. 



Using (11), 



-4a 2 , if a>0, 
limsupu n lnP(X n £]a,b[) < { -46 2 , if b < 0, 



0. 



if a < < b. 



This leads to 
(12) 



{AeA-.xeA} 
Now, analogously, we have 



sup < - lim sup u n In ¥(X n £ A)> >Ax 2 . 



,1 a 1 6 



2 yjnu n 2 yjnu r , 



n-l-l 



b-a 1_ 

s/rvWn~B n _i- 
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But, 



m 



1 



+ 



1 



+ 



,2 ' y/nu n ' 2 \/nu n 
So, using (11), we obtain 



' 1 


b 2 


4 " 




1 


a 2 


\ 

4 


UUn 


1 


max{a 2 ,6 2 } 


. 4~ 


nu n 



-4b 2 , 



limsuptt n lnP(X n e]a, b[) < \\ —4a 2 , 

-4max{a 2 ,o 2 }, 

Consequently, 



if a > 0, 
if 6<0, 
if a < < b. 



if a > 0, 
if 6<0, 
if a < < b. 



(13) 



sup ^ - liminf u n lnF(X n € A) \ < Ax 2 



{AeA-.xeA} 



Using Theorem 4.1.11 of Dembo and Zeitouni (1998), (12) and (13), we get a 
weak LDP for X n with rate function J\ . The full LDP is then a consequence 
of the exponential tightness of the laws of Y n . Indeed, for K > and n large 
enough, we have 

K 

:,1 



¥(X n G[-K,K] c )=2F [Y n G 



2B 



< B 



n-l-l 



-1 

n-l-l 



1 

2 yjnu n 

x n-l-l 

1/2+K/^rvu^ 

1 K 2 \ n - [ - l fl 



nu r 



(1-x) 



K 



n-l-l 



dx 



y/nUn 



Thus, using (11) and the expansion 

K 2 



I - 



nu r 



4K 2 

ln4+ 1 h o 

nun 



nu r , 



we obtain 

limsupn„lnP(X„ G [-K,K] C ) < -4K 2 , 

n 

which implies the exponential tightness. □ 

Lemma 4.5. For every sequence (u n ) decreasing to with n~ l = o(u n ) 
and k £N* , the sequence of random vectors Z' n := 2^/nu n Ak(p( k \Z n ) —p^) 
satisfies a LDP with good rate function and speed iu n ). 
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Proof. Using the two previous lemmas, we can state a LDP for the 
random vectors W n := ^/nu n (p^ k \Z n ) — p^)) with good rate function 

J' k {x) = 4x T x. 

So, using the contraction principle, we obtain a LDP for Z' n = 2A k W n with 
good rate function 

J k (x) = inf{4(y) : x = 2A k y} = \x T {A^f A~ l x 

= -^X ^ ' ^ 

Lemma 4.6. The random vector sequences (Z^) and (Z' n ) are exponen- 
tially equivalent. 

Proof. It is obvious that there exists a constant Co such that 
| o ( | p (*) - p( fc ) 1 2 ) I < C 1 - p< fc ) | 2 , 
where 0(\p^ — p^\ 2 ) is the function which appears in (9). So, 

{|zw - z;i > £ } = {^|o(|p( fe ) - p( fc )| 2 )i > e } 

c|^|p (fc) -p (fc) | 2 >^ 



Let Ci > 0. For yjrvu^ > C\ , 



£^lV /2 



This leads to 

limsup-u„lnP(|^ fc ) - Z' n \ >e) 



<lim^supn n lnP(^y^|p( fc )-p( fc )| > ^ < - 4 g, 

where the last inequality follows from the LDP for the random vectors W n 
(Lemma 4.5). Therefore, taking C\ — > oo, we obtain 

limsupu n lnP(|ifW - Z' n \ > e) = -oo. 

n 

Then (Z^) and (Z' n ) are exponentially equivalent. □ 

Theorem 2.7 is obtained directly using the two previous lemmas. 



4.4. Proof of Theorem 2.3. 
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4.4.1. Statement of the LDP. Consider on M the product topology and 
the product algebra. Let M+ be the subset of M defined by 

M+ = {c(°°) G M:c n+1 = c+(cW)}. 

We define on M the following sequence of p.m.s (P n ) n ^: 

P n (B) = P n (Tl n (B n M+)), B measurable set of M. 

Theorem 4.7. (P n )nen satisfies a LDP with rate function 
I M ( C M ) = lim - ln(4 n (c+ (c^ ) - c" (c^ ))). 

Proof. Let k G N and be a measurable set of We have, for 
n > k, 

P n o nr/ 1 ^) = P n (B k x m n ) = p n (n n ( J B fc xK M n m+)) = p n (s fc x R n ~ k ) 

= P(Z^£B k ). 

Therefore, the family {P n o IL7 } ng i*j satisfies a LDP with rate function 
Jfc. The Dawson-Gartner theorem [Theorem 4.6.1 of Dembo and Zeitouni 
(1998)] leads to a LDP for P n on the projective limit of the spaces M n which 
is M. The good rate function is Im(c (oo) ) = sup n L n (U n c^). For c (°°) G M, 
7„(n n c^) is nonincreasing in n, thus, sup n / n = lim n / n [see (10)]. □ 

Let G denote the one-to-one correspondence between M and P([0, 1]). It is 
easy to see that it is continuous. Therefore, using the contraction principle, 
we obtain a LDP for the probability measure family {P n o G _1 } ?te N- But, 
for B Borelian of P([0,1]), 

Q n (B) = (P n o G~ 1 )(B) = P n (G-\B)). 

We have C M G G~ 1 (B) n M+ if, and only if, G( C H) = a+{c^) G B. So, 

(P n o G" 1 )^) = P(Z n G IlnCGT 1 ^) n M+)) 

= P(a+(Z n )GB). 

Therefore, the contraction principle gives the LDP for (Q n )neN with good 
rate function 

(14) ftji) = hm-ln[4"- 1 (c+( / i) - c^))], 

where c+(/i) and c~(fi) denote c + (c^ n_1 ^(/i)) and c _ (c( n_1 ) (//)), respectively. 
Obviously, from (10), we may conclude that / achieves the value only at 
v. 

Lemma 4.8. The function I in (14) is convex. 
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Proof. Let \i = a\i\ + 0to with [i, fj,\, to G P([0, 1]) and a,f3 positive 
real numbers such that a + j3 = 1. Thus, 

C W(/i)=«cW( m )+/3 C W( M2 ). 
Thanks to convexity of M n+ \, 

ac + (c^(^))+f3c + (c^(to)) G {c: ( C W( M ),c) € M n+1 } 
and by the definition of c + , we have 

c + (c^ (/i) ) > «c+ ( C W ( Ml )) + i3c + (c< n > fa)). 
Analogously, we obtain that c"(c"(/i)) < uc~ (c~ n \n\)) + /3c~ (c- n > (to)) ■ 



Thus, 



c n (A*) - C n (M) > «( c n (Ml) ~ C n (^l)) + - C n (/i 2 )). 



Since i 1— > — ln4 n i is a nonincr easing convex function in t > for all n G N*, 
we obtain that I is convex as the supremum of convex functions. □ 

4.4.2. Identification of the rate function. From Corollary 1.4.6 of Dette and Studden 
(1997), we may write the following equation: 

(15) r n (c^) = H "- lHw ~ 1 



where, for c ( 2m+1 ) G M 2m +i (m G N*), 

co • • • c m 



(ci,...,c„_i) G M n _i, 



S2m 



II 



2m 



Cm ' ' ' C-2r 
Cl - C 2 



Cm Cm+1 
Cl • • • C m+ i 



Cm C m _|_i 



C 2m -1 — C 2r 



H 



2m+l 



Cm+1 • • 
CO - Cl 



C2m+1 



Cm Cm+1 



Cm C m _|_i • • • C2m C 2 m+1 

These determinants are called Hankel determinants. Within the moment 
problem they play an important role. The conditions under which a se- 
quence is a moment sequence can be expressed on these Hankel deter- 
minants [see Karlin and Studden (1966), Krein and Nudelman (1977) and 
Dette and Studden (1997)]. 
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We have the following relations between Hankel determinants. Let fi G 
P([0, 1]). Let fj G P([0, 1]) be defined by 



2 

X — X 



H'(dx) = —— -—-ii(dx). 

ClW-C2{H) 

Obviously, fjf verifies 

C k+ i([l) -c fe+2 (/x) 



Hence, 



Moreover, 



ci(/x) -c 2 (m) 
H 2 mO) = (ci(m) -c 2 (/i)) m H 2(m _ 1) (^')- 



H 2m (^) / / \ / sxH2(m-l)(/ i ') 
-(ci(/i)-c 2 (//))- 



H 2 ( m -1)(^) S2(m-2)(^') 

In what follows we set = cW(fi), c+ = c + (c( n_1 )), and c~ = c~ (c^™ -1 )). 
The Hankel matrices depending on jj are tagged with a prime. We have 



( C n+1 C n+l)( C n c n ) 



H ra H n 
H n -2H n _ 2 

/ xSn-2 Sn. 

(Cl-C2)=, ~ ; 



where the last equality is only true if n is even. 

As the limit in (14) exists, we can calculate it taking n even, 

limM^- 1 ^-^)) 



limln(4 2m ( C + m+1 -c 2m+1 )) 



1 

~~ 2 

+ ihmln(4 2m - 1 ( C + m - C2 - m )) 
= g ^^(^"^^m+l - c 2m+l)( c 2m ~ c 2m)) 

= ilm,h(4*-'( q - Q ) g'-" ). 
2 m V £±2(m-2) S2(m-1)/ 

Grenander and Szego (1958) have showed a general limit theorem for the 
quotient of Hankel determinants (see Theorem in Section 5.2 and Section 
6.3 of this book). In our framework it can be written in the following way. 
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Theorem 4.9. Let jneP([0, 1]). Call f(x) its Radon-Nikodym deriva- 
tive with respect to the Lebesgue measure, then 

lim 4 2n+1 TT S2n y , = 2tt exp ( C In f(x)du(x 

h 2 „- 2 (m) U m ; V ' 

Now we may write 

limln(4 2 -- 1 (c+ m -c 2 - m )) 

= 1 lim In ( 4 2m+1 -J^mN + I Jim m ( 4 2m " 1 

+ ilimln(4- 1 (ci -c 2 )) 
2 m 



1 p/ \ j 1 \ 1 f\ (x-x 2 )f{x) , , . 
-/ \nf(x)di>(x) + - ln^ ^LL^dj/^ 

2 Jo 2 jo ci - c 2 



+ ^ln(ci -c 2 ) + -ln(7r) 







1 

ln[vr(x - x 2 )f{x)] dv{x) = -1(h). 



□ 



4.5. Proof of Theorem 2.5. The LDP for (Q n ) is a direct consequence of 
Theorem III. 17 of den Hollander (2000). The rate function which controls 
the LDP is 

I F (n)=I(ji)-F(n) + K F , 

where K F := sup M / 6F ( [0jl ]){F(// / ) -/(//)}■ □ 
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